Encoding/decoding method and apparatus with vector derivation mode

ABSTRACT

An encoding method, a decoding method, an encoding apparatus, a decoding apparatus, for a video image. The encoding method includes: determining an optimal integrated candidate block for a current block based on a motion vector integration technology; determining, based a prediction direction of the optimal integrated candidate block, a motion vector derivation mode that needs to be used by a decoder; correcting a motion vector of the current block based on the motion vector derivation mode; and determining a residual between a predicted value and an original value of the current block based on the corrected motion vector, thereby encoding the current block. According to the technical solutions, a more accurate predicted value is obtained by correcting the motion vector, and a smaller residual is generated, thereby improving encoding efficiency, avoiding an increase in data bandwidth, improving decoding quality, and reducing calculation complexity.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No.PCT/CN2011/081497, filed on Oct. 28, 2011, which claims priority toChinese Patent Application No. 201110057554.1, filed on Mar. 10, 2011,both of which are hereby incorporated by reference in their entireties.

TECHNICAL FIELD

The present invention relates to the image processing field, and inparticular, to an encoding/decoding method, an encoding apparatus, adecoding apparatus, and a system for a video image in the imageprocessing field.

BACKGROUND

Motion estimation and motion compensation are important technologies forvideo compression. A considerable part of bits in a compressed videobit-stream are used to transmit motion vector information. Motioninformation of a block may include a prediction direction, a motionvector, and reference frame information of the block. The referenceframe information may represent a frame or frames that are referencedfor obtaining the block. In cases of a low bit rate, especially for ahigh-definition video, bits consumed to transmit motion vectorinformation generally account for over 50% of the total number of bitsin a bitstream. Therefore, it is very important to efficiently encodethe motion information.

At present, inter-frame prediction technologies for improving motionprediction/compensation mainly include a decoder motion vectorderivation technology and a motion vector integration technology.

In the decoder motion vector derivation technology, an encoder and adecoder need to process a block by using the same search/derivationmechanism in a large range. With respect to a block that is encoded bythe encoder according to this technology, the decoder restores the blockby using a specific search/derivation mechanism in a large range. Forexample, by fully using correlation between a block and an adjacentencoded (or decoded) region in a current frame where the block islocated, or using correlation between a block and previous and nextadjacent encoded (or decoded) frames of the current frame where theblock is located, decoding is performed by performing search/derivationin a large range. This method may achieve high compression performance;however, the encoder and the decoder need to perform search/derivationin a large range of image frames, which is highly complex incalculation.

In the motion vector integration technology, by using strong correlationbetween a motion vector of a current block and a motion vector of aneighboring block, a motion vector of a closest neighboring block may beused to replace the motion vector of the current block, so that an indexvalue of the neighboring block is used to replace the motion vector ofthe current block during encoding to reduce encoded bits, where theneighboring block is an optimal integrated candidate block of thecurrent block. A neighboring block of the current block refers to ablock whose position is any one of the following positions:corresponding positions in previous and next adjacent frames of thecurrent block, an upper side of the current block, a left side of thecurrent block, an upper-left corner of the current block, and anupper-right corner of the current block. In the prior art, circumstancesunder which the motion vector integration technology may be used aredisclosed, and how to determine the optimal integrated candidate blockof the current block is also disclosed. In a case where the motionvector integration technology can be used, an encoder determines one ofneighboring blocks as an optimal integrated candidate block for acurrent block, uses a motion vector thereof to replace a motion vectorof the current block, and transmits an index value of positioninformation of the optimal integrated candidate block as the encodinginformation of the current block to a decoder; the decoder determinesthe motion vector of the current block according to the index value tofind a reference block of the current block, thereby determining apredicted value of motion compensation of the current block.

In the motion vector integration technology, by using the encoding indexvalue instead of the motion vector of the current block, encodingefficiency may be improved; however, the motion vector of the currentblock is not exactly the same as the motion vector of the neighboringblock, and there is a big error if the motion vector of the neighboringblock is directly used to replace the motion vector of the currentblock; consequently, accuracy of motion compensation decreases, imagecompression is affected, and encoding quality of an image deteriorates.

Although the encoder may encode a motion vector difference existingbetween the motion vector of the current block itself and the motionvector of the optimal integrated candidate block in a case where themotion vector of the current block is replaced with the motion vector ofthe optimal integrated candidate block, so as to correct the motionvector of the current block on the decoder, the encoded motion vectordifference may occupy a large number of bits in an encoded bit stream,which affects encoding efficiency. Although the encoder may also encodea residual between the motion vector of the optimal integrated candidateblock for the current block and an original value of the current block,transmit the residual to the decoder for correction, because theresidual between the motion vector of the optimal integrated candidateblock for the current block and the original value of the current blockis large, the residual may occupy a large number of bits in the encodedbit stream, which also affects encoding efficiency.

SUMMARY

Embodiments of the present invention provide an encoding/decodingmethod, an encoding apparatus, a decoding apparatus, and a system for avideo image, which may effectively reduce a predication deviation causedbecause a motion vector of a current block and a motion vector of anoptimal integrated candidate block are not exactly the same when amotion vector integration technology is used, so that a predicted valuefor the current block is more accurate and a residual is smaller,thereby improving encoding efficiency and avoiding an increase in databandwidth.

An embodiment of the present invention provides an encoding method for avideo image, including: determining an optimal integrated candidateblock for a current block based on a motion vector integrationtechnology; determining, based a prediction direction of the optimalintegrated candidate block, a motion vector derivation mode that needsto be used by a decoder; correcting a motion vector of the current blockbased on the motion vector derivation mode; and determining a residualbetween a predicted value and an original value of the current blockbased on the corrected motion vector, thereby encoding the currentblock.

An embodiment of the present invention provides a decoding method for avideo image, including: determining an optimal integrated candidateblock for a current block based on an encoded bit stream obtained byusing a motion vector integration technology; when a motion vectorderivation mode needs to be used, determining, based on a predictiondirection of the optimal integrated candidate block, the motion vectorderivation mode that needs to be used; correcting a motion vector of thecurrent block based on the motion vector derivation mode; and decodingthe current block based on the corrected motion vector and a residualcarried in the encoded bit stream.

An embodiment of the present invention provides an encoding apparatusfor a video image, including: a first determining module, configured todetermine an optimal integrated candidate block for a current blockbased on a motion vector integration technology; a second determiningmodule, configured to determine, based a prediction direction of theoptimal integrated candidate block, a motion vector derivation mode thatneeds to be used by a decoder; a correcting module, configured tocorrect a motion vector of the current block based on the motion vectorderivation mode; and an encoding module, configured to determine aresidual between a predicted value and an original value of the currentblock based on the corrected motion vector, thereby encoding the currentblock.

An embodiment of the present invention provides a decoding apparatus fora video image, including: a first determining module, configured todetermine an optimal integrated candidate block for a current blockbased on an encoded bit stream obtained by using a motion vectorintegration technology; a second determining module, configured todetermine, based on a prediction direction of the optimal integratedcandidate block, a motion vector derivation mode that needs to be usedwhen the motion vector derivation mode needs to be used; a correctingmodule, configured to correct a motion vector of the current block basedon the motion vector derivation mode; and a decoding module, configuredto decode the current block based on the corrected motion vector and aresidual carried in the encoded bit stream.

An embodiment of the present invention further provides a system for avideo image, where the system includes an encoding apparatus and adecoding apparatus. The encoding apparatus includes: a first determiningmodule, configured to determine an optimal integrated candidate blockfor a current block based on a motion vector integration technology; asecond determining module, configured to determine, based a predictiondirection of the optimal integrated candidate block, a motion vectorderivation mode that needs to be used by a decoder; a correcting module,configured to correct a motion vector of the current block based on themotion vector derivation mode; and an encoding module, configured todetermine a residual between a predicted value and an original value ofthe current block based on the corrected motion vector, thereby encodingthe current block. The decoding apparatus includes: a first determiningmodule, configured to determine an optimal integrated candidate blockfor a current block based on an encoded bit stream obtained by using amotion vector integration technology; a second determining module,configured to determine, based on a prediction direction of the optimalintegrated candidate block, a motion vector derivation mode that needsto be used when the motion vector derivation mode needs to be used; acorrecting module, configured to correct a motion vector of the currentblock based on the motion vector derivation mode; and a decoding module,configured to decode the current block based on the corrected motionvector and a residual carried in the encoded bit stream.

Based on the above technical solutions, a motion vector of a currentblock is determined firstly according to a motion vector of an optimalintegrated candidate block, and a motion vector derivation mode is usedto correct the motion vector of the current block; therefore, the motionvector of the current block is more accurate, a calculated residual issmaller, and the residual occupies a smaller number of bits in anencoded bit stream, thereby improving encoding efficiency and avoidingan increase in data bandwidth. Meanwhile, because the corrected motionvector is used, a predicted value estimated based on the correctedmotion vector is closer to an original value of the current block, andbecause correction is performed by using the residual, a more accuratedecoding result may be obtained, thereby improving image decodingquality.

BRIEF DESCRIPTION OF DRAWINGS

To illustrate the technical solutions in the embodiments of the presentinvention more clearly, the following briefly introduces theaccompanying drawings required for describing the embodiments.Apparently, the accompanying drawings in the following description showmerely some embodiments of the present invention, and a person ofordinary skill in the art may still derive other drawings from theseaccompanying drawings without creative efforts.

FIG. 1 is a flowchart of an encoding method for a video image accordingto an embodiment of the present invention;

FIG. 2 is a flowchart of a method for correcting a motion vectoraccording to an embodiment of the present invention;

FIG. 3 is a schematic diagram of an example of correcting a motionvector according to an embodiment of the present invention;

FIG. 4 is a flowchart of a decoding method for a video image accordingto an embodiment of the present invention;

FIG. 5 is a flowchart of a method for correcting a motion vectoraccording to an embodiment of the present invention;

FIG. 6 is a flowchart of another decoding method for a video imageaccording to an embodiment of the present invention;

FIG. 7 is a structural block diagram of an encoding apparatus for avideo image according to an embodiment of the present invention;

FIG. 8 is a structural block diagram of another encoding apparatus for avideo image according to an embodiment of the present invention;

FIG. 9 is a structural block diagram of a decoding apparatus for a videoimage according to an embodiment of the present invention;

FIG. 10 is a structural block diagram of another decoding apparatus fora video image according to an embodiment of the present invention; and

FIG. 11 is a schematic diagram of a system for a video image accordingto an embodiment of the present invention.

DESCRIPTION OF EMBODIMENTS

The following clearly and completely describes the technical solutionsin the embodiments of the present invention with reference to theaccompanying drawings in the embodiments of the present invention.Apparently, the described embodiments are merely a part rather than allof the embodiments of the present invention. All other embodimentsobtained by a person of ordinary skill in the art based on theembodiments of the present invention without creative efforts shall fallwithin the protection scope of the present invention.

Firstly, an encoding method 100 for a video image according to anembodiment of the present invention is described with reference to FIG.1.

As shown in FIG. 1, the method 100 includes the following: in S110, anoptimal integrated candidate block for a current block is determinedbased on a motion vector integration technology; in S120, a motionvector derivation mode that needs to be used by a decoder is determinedbased a prediction direction of the optimal integrated candidate block;in S130, a motion vector of the current block is corrected based on themotion vector derivation mode; and in S140, a residual between apredicted value and an original value of the current block is determinedbased on the corrected motion vector, thereby encoding the currentblock.

On an encoder, an optimal integrated candidate block may be determinedbased on a motion vector integration technology, thereby determining aninitial motion vector of a current block. Then, according to whether aprediction direction of the optimal integrated candidate block isbi-directional prediction, forward prediction, or backward prediction, amotion vector derivation technology is determined to correct the initialmotion vector of the current block. After the corrected motion vector isobtained, a predicted value of the current block may be determined basedon the motion vector. Because the encoder is aware of an original valueof the current block, a residual between the predicted value and theoriginal value may be calculated, and the residual may be encoded intoan encoded bit stream together with an index value of the optimalintegrated candidate block.

According to the encoding method provided by the embodiment of thepresent invention, a motion vector of a current block is determinedfirstly according to a motion vector of an optimal integrated candidateblock, and a motion vector derivation mode is used to correct the motionvector of the current block; therefore, the motion vector of the currentblock is more accurate, a calculated residual is smaller, and theresidual occupies a smaller number of bits in an encoded bit stream,thereby improving encoding efficiency and avoiding an increase in databandwidth. Meanwhile, because the corrected motion vector is used, apredicted value estimated based on the corrected motion vector is closerto an original value of the current block, and because correction isalso implemented to the residual, a more accurate decoding result may beobtained by a decoder, thereby improving image decoding quality.

The encoding method 100 provided by the embodiment of the presentinvention is described in detail as follows:

In S110, an optimal integrated candidate block for a current block isdetermined based on a motion vector integration technology.

This step is basically the same as the prior art. A motion vector of theoptimal integrated candidate block is a motion vector that is closest toa motion vector of the current block. When the optimal integratedcandidate block is obtained based on the motion vector integrationtechnology, motion information of the optimal integrated candidateblock, for example, a prediction direction, a prediction vector, andreference frame information of the optimal integrated candidate block,may be determined.

In S120, a motion vector derivation mode that needs to be used by adecoder is determined based on the prediction direction of the optimalintegrated candidate block.

When the prediction direction of the optimal integrated candidate blockis bi-directional prediction, the motion vector derivation mode thatneeds to be used by the decoder may be searching two frames, namely, aforward reference frame and a backward reference frame, for referenceblocks of the current block, where positions of the reference blocks inthe two frames vary as the search is performed, and then referenceblocks that meet a specific condition are selected from the foundreference blocks. In the following, this motion vector derivation modeis also referred to as a bi-directional mode.

When the prediction direction of the optimal integrated candidate blockis forward prediction, the motion vector derivation mode that needs tobe used by the decoder may be: fixing a reference block of the currentblock in a forward reference frame by using a candidate motion vector(that is, the motion vector of the optimal integrated candidate block)in the motion information of the optimal integrated candidate block;searching a backward reference frame for a reference block, where aposition of the reference block in the backward reference frame variesas the search is performed; and then selecting a reference block thatmeets a specific condition from the found reference blocks. In thefollowing, this motion vector derivation mode is also referred to as aforward mode.

When the prediction direction of the optimal integrated candidate blockis backward prediction, the motion vector derivation mode that needs tobe used by the decoder may be: fixing a reference block of the currentblock in a backward reference frame by using a candidate motion vectorin the motion information of the optimal integrated candidate block;searching a forward reference frame for a reference block, where aposition of the reference block in the forward reference frame varies asthe search is performed; and then selecting a reference block that meetsa specific condition from the found reference blocks. In the following,this motion vector derivation mode is also referred to as a backwardmode.

The specific condition may refer to a minimum matching error, and isdescribed in detail in the following.

In S130, the motion vector of the current block is corrected based onthe motion vector derivation mode.

Under the motion vector derivation mode determined in S120, a methodcorresponding to the determined motion vector derivation mode is used tocorrect the motion vector of the current block. It may be regarded thatthe three motion vector derivation modes set three schemes about how tocorrect a motion vector. If it is determined to use one of the motionvector derivation modes, the motion vector is corrected under a schemecorresponding to the motion vector derivation mode. For example, if themotion vector derivation mode determined in S120 is a bi-directionalmode, two frames, namely, a forward reference frame and a backwardreference frame, are searched for reference blocks of the current block,where positions of the reference blocks in the two frames vary as thesearch is performed, and then reference blocks that meet a specificcondition are selected from the found reference blocks. The motionvector of the current block is corrected by using the reference blocksthat are found according to this mode.

According to an embodiment of the present invention, under thedetermined motion vector derivation mode, the motion vector of thecurrent block may be corrected according to the motion vector andreference frame information of the optimal integrated candidate block.

In the motion vector integration technology, when the optimal integratedcandidate block of the current block is determined, the motion vector ofthe current block is the motion vector of the optimal integratedcandidate block. Because the motion vector of the current block is notexactly the same as the motion vector of the optimal integratedcandidate block, the motion vector of the current block needs to becorrected. The correction process may use the motion vector andreference frame information of the optimal integrated candidate block.For example, the motion vector of the current block may be corrected bysearching a reference frame of the current block for a reference blockthat meets a specific condition. The specific condition may be a minimummatching error between found reference blocks.

According to an embodiment of the present invention, a method 200 shownin FIG. 2 may be used to correct a motion vector of a current block.

In S210, an initial reference block in a forward reference frame and aninitial reference block in a backward reference frame are determined forthe current block based on a motion vector and reference frameinformation.

In a bi-directional mode, a forward reference frame and a backwardreference frame of a current block may be determined based on referenceframe information of an optimal scheme candidate block. A motion vectorof the current block may be determined based on a motion vector of theoptimal scheme candidate block, thereby determining an initial referenceblock in the forward reference frame and an initial reference block inthe backward reference frame for the current block.

In a forward mode, a forward reference frame of a current block may bedetermined based on reference frame information of an optimal schemecandidate block. An initial reference block in the forward referenceframe may be determined for the current block based on a motion vectorof the optimal scheme candidate block. A backward reference frame of thecurrent block may be a reference frame immediately following a framewhere the current block is located. In the backward reference frame ofthe current block, a position indicated by a zero motion vector is usedas a reference block of the current block. That is, in the backwardreference frame, a block that is located in the same position as thecurrent block is the initial reference block of the current block.

In a backward mode, a backward reference frame of the current block maybe determined based on reference frame information of an optimal schemecandidate block. An initial reference block in the backward referenceframe may be determined for the current block based on a motion vectorof the optimal scheme candidate block. A forward reference frame of thecurrent block may be a reference frame immediately preceding a framewhere the current block is located. In the forward reference frame ofthe current block, a position indicated by a zero motion vector is usedas a reference block of the current block. That is, in the forwardreference frame, a block that is located in the same position as thecurrent block is the initial reference block of the current block.

By using the above modes, the initial reference block in the forwardreference frame and the initial reference block in the backwardreference frame may be determined for the current block.

In S220, sub-pixel neighborhoods of the initial reference blocks aresearched for reference blocks having a minimum matching error.

A sub-pixel neighborhood of a block may be a region that takes anupper-left pixel of the block as a circle center, and a ½ or ¼ pixeldistance as a radius. The sub-pixel neighborhood of an initial referenceblock may be a region that takes a point indicated by the initial motionvector of the current block, that is, a point at an upper-left corner ofthe initial reference block, as a circle center, and a specific pixeldistance as a radius. In the search of reference blocks, the motionvector of the current block is modified so that the indicated pointmoves within the sub-pixel neighborhood, thereby obtaining differentreference blocks. In this case, interpolation needs to be performedbetween relevant pixels, thereby determining a pixel value after theposition of the reference block is moved by a small distance.

When reference blocks having a minimum matching error are determinedwithin sub-pixel neighborhoods, the motion vector is corrected based onsuch reference blocks.

That is, in the bi-directional mode, the sub-pixel neighborhood in theforward reference frame is searched for a reference block; meanwhile,the sub-pixel neighborhood in the backward reference frame is alsosearched for a reference block; and a matching error between two foundreference blocks is calculated. A minimum matching error is selectedfrom all calculated matching errors, where two reference blockscorresponding to the minimum matching error are the reference blocksfound in S220. The matching error may be an absolute error and SAD(sumof absolute differences).

In the forward mode, because the reference block in the forwardreference frame is fixed, only the backward reference frame needs to besearched for a reference block. Each time a reference block is found, amatching error between the reference block and the reference block inthe forward reference frame is calculated. A minimum matching error isselected from all calculated matching errors, where reference blockscorresponding to the minimum matching error are a reference block foundin the backward reference frame and the reference block fixed in theforward reference frame.

In the backward mode, because the reference block in the backwardreference frame is fixed, only the forward reference frame needs to besearched for a reference block. Each time a reference block is found, amatching error between the reference block and the reference block inthe backward reference frame is calculated. A minimum matching error isselected from all calculated matching errors, where reference blockscorresponding to the minimum matching error are a reference block foundin the forward reference frame and the reference block fixed in thebackward reference frame.

How to search for reference blocks having a minimum matching error in abi-directional mode is described by using FIG. 3 as an example. Thoseskilled in the art may easily think that searching for a reference blockin a forward mode and in a backward mode is similar thereto.

In FIG. 3, a frame where a current block CB is located is Fn, a forwardreference frame of the current block is Ff, and a backward referenceframe of the current block is Fb. By performing S210, it is determinedthat initial reference blocks of the current block CB are Rf₀ and Rb₀. Acircle is drawn by taking points at upper-left corners of the initialreference blocks Rf₀ and Rb₀ as circle centers and specific pixeldistances (for example, ¼ pixel distance) as radiuses, thereby obtainingsub-pixel neighborhoods of the initial reference blocks, as shown bycircular parts in FIG. 3. A process of searching for the referenceblocks is constantly changing a motion vector of the current block CB,so that the upper-left corners of the initial reference blocks Rf₀ andRb₀ move within their respective sub-pixel neighborhoods, therebydetermining new reference blocks, for example, reference blocks Rf₁ andRb₁.

It is assumed that a forward motion vector of an optimal integratedcandidate block is (candFMV_x, candFMV_y), and a backward motion vectoris (candBMV_x, candBMV_y). The forward motion vector and the backwardmotion vector are also a forward motion vector and a backward motionvector of the current block, and directions indicated by the motionvectors are upper-left corners of the initial reference blocks, therebydetermining Rf₀ and Rb₀.

In order to search for the reference blocks, the forward motion vectoris shifted by (x, y), and the backward motion vector is shifted by (u,v). The forward motion vector after the change is (candFMV_x+x,candFMV_y+y), and the backward motion vector after the change is(candBMV_x+u, candBMV_y+v). Assuming that, based on the two motionvectors after the change, blocks Rf1 and Rb1 are obtained by moving theupper-left corners, a matching error between blocks Rf1 and Rb1 may bedetermined. The matching error may be a total of absolute values ofdifferences between pixel values of corresponding pixels in the blocksRf₁ and Rb₁. Based on the same mode, the shift of the motion vector ischanged again, thereby determining a matching error between the blocksin such a case. In the same manner, multiple matching errors may becalculated. In these matching errors, a minimum matching error isselected, where a reference block corresponding to the matching errorbetween the reference block and the current block is the finally foundreference block.

In S230, the motion vector of the current block is corrected based onthe found reference blocks.

After the reference blocks having the minimum matching error are found,a change of the motion vector of the current block with respect to thefound reference blocks may be determined, thereby correcting the motionvector of the current block.

For example, in the example shown in FIG. 3, if a matching errorobtained when the upper-left corner of the initial reference block Rf₀is moved by (a, b) and the upper-left corner of the initial referenceblock Rb₀ is moved by (c, d) is minimum, the forward motion vector ofthe current block is corrected into (candFMV_x+a, candFMV_y+b), and thebackward motion vector of the current block is corrected into(candBMV_x+c, candBMV_y+d).

The process of searching for the reference blocks having a minimummatching error also involves correcting the motion vector of the currentblock. The reference blocks having the minimum matching error aresearched for by correcting the motion vector of the current block, andwhen the reference blocks having the minimum matching error are found,it indicates that correcting the motion vector of the current block iscompleted. A final result of the motion vector of the current block isobtained in S230.

In S140, a residual between a predicted value and an original value ofthe current block is determined based on the corrected motion vector,thereby encoding the current block.

The reference blocks of the current block may be determined based on thecorrected motion vector, thereby predicting a predicted value of thecurrent block according to the reference blocks. For example, apredicted value of the current block may be an average value of thereference block in the forward reference frame and the reference blockin the backward reference frame.

Because the original value of the current block is known to the encoder,the encoder can determine the residual between the predicted value andthe original value. The current block is encoded based on the residual,thereby generating an encoded bit stream of the current block.

According to an embodiment of the present invention, if the residual issmaller than an initial residual that is determined based on a motionvector integration technology, the residual and a first instruction flagfor instructing the decoder to implement a motion vector derivation modeare carried in the encoded bit stream.

Based on the motion vector integration technology, the initial residualof the current block may be determined. Because it may be determinedthat the motion vector of the current block is the motion vector of theoptimal integrated candidate block based on the motion vectorintegration technology, a reference block in a reference frame may bedetermined for the current block, where the reference block may be theinitial reference block mentioned above. The predicted value of thecurrent block may be determined by using the initial reference block,thereby determining the residual between the predicted value and theoriginal value of the current block. Because the residual is obtainedwithout correcting the motion vector of the current block, the residualis the initial residual of the current block.

If a residual obtained based on the corrected motion vector is smallerthan the initial residual, it indicates that a bit stream having fewerbits may be generated by using a motion vector derivation mode, and amore accurate prediction result is available; therefore, the encoderneeds to instruct the decoder to implement the motion vector derivationmode, and use the same method as the encoder to correct the motionvector, thereby obtaining the predicted value and correcting thepredicted value by using the residual. Therefore, in such a case, theresidual and the first instruction flag for instructing the decoder toimplement the motion vector derivation mode are carried in the encodedbit stream, so that the decoder implements the motion vector derivationmode according to the first instruction flag.

According to an embodiment of the present invention, if the residual isgreater than or equal to the initial residual, the initial residual anda second instruction flag for instructing the decoder not to implement amotion vector derivation mode are carried in the encoded bit stream.

If the residual is greater than or equal to the initial residual, itindicates that more bits are generated by using the motion vectorderivation mode, and the prediction result is not as good as aprediction result in a case where the motion vector is not corrected.Therefore, it is unnecessary to use the motion vector derivation mode toperform encoding. The initial residual is determined directly accordingto the motion vector that is not corrected, and the decoder isinstructed not to implement the motion vector derivation mode. Thedecoder uses the initial motion vector of the current block to determinea predicted value, and then corrects the predicted value by using theinitial residual. Therefore, in such a case, the initial residual andthe second instruction flag for instructing the decoder not to implementthe motion vector derivation mode are carried in the encoded bit stream,so that according to the second instruction flag, the decoder does notimplement the motion vector derivation mode.

The first instruction flag and the second instruction flag may be twodifferent values of a same instruction flag. For example, the firstinstruction flag refers to a case where a value of the instruction flagis 1, and the second instruction flag refers to a case where the valueof the instruction flag is 0. Those skilled in the art may also thinkthat the first instruction flag and the second instruction flag may alsobe instruction flags that independently occupy different positions of anencoded bit stream.

According to the encoding method provided by the embodiment of thepresent invention, compared with a conventional motion vector derivationtechnology of a decoder, in a process of correcting a motion vector of acurrent block to search for reference blocks having a minimum matchingerror, because a search range is limited to a small range of a sub-pixelneighborhood, and because a reference frame is fixed, calculationcomplexity may be reduced in comparison with searching in a large range.Compared with a conventional motion vector integration technology, themotion vector is corrected so that a predicted value is more accurateand a residual is smaller, thereby facilitating improvement of decodingquality; in addition, the residual occupies a smaller number of bits inan encoded bit stream, thereby improving encoding efficiency andavoiding an increase in data bandwidth.

Next, a decoding method for a video image according to an embodiment ofthe present invention is described with reference to FIG. 4 and FIG. 5.

As shown in FIG. 4, a decoding method 400 includes the following: inS410, an optimal integrated candidate block for a current block isdetermined based on an encoded bit stream obtained by using a motionvector integration technology; in S420, when a motion vector derivationmode needs to be used, the motion vector derivation mode that needs tobe used is determined based on a prediction direction of the optimalintegrated candidate block; in S430, a motion vector of the currentblock is corrected based on the motion vector derivation mode; and inS440, the current block is decoded based on the corrected motion vectorand a residual carried in the encoded bit stream.

On a decoder, after the encoded bit stream based on the motion vectorintegration technology is received from an encoder, the optimalintegrated candidate block may be determined according to an index valuecarried in the encoded bit stream that uses the motion vectorintegration technology, of the optimal integrated candidate block. If itis determined, according to information carried in the encoded bitstream, that a motion vector derivation mode needs to be used, thenaccording to whether a prediction direction of the optimal integratedcandidate block is a bi-directional prediction, a forward prediction, ora backward prediction, a motion vector derivation technology isdetermined to correct an initial motion vector of the current block.After the corrected motion vector is obtained, a predicted value of thecurrent block may be determined based on the motion vector, and aresidual carried in the encoded bit stream is used to correct thepredicted value, thereby decoding the current block.

According to the decoding method provided by the embodiment of thepresent invention, when it is necessary to use the motion vectorderivation mode, the initial motion vector of the current block may becorrected to obtain a more accurate motion vector, thereby obtaining amore accurate predicted value; in addition, the correction is performedby using the residual, image decoding quality may be improved.Meanwhile, although the residual in the bit stream is small, on thebasis that the correction for the motion vector does not affect theprediction for the current block, a residual having a smaller number ofbits may be carried in the bit stream to perform prediction, therebyavoiding an increase in data bandwidth and facilitating improvement ofencoding efficiency.

The decoding method 400 according to the embodiment of the presentinvention is described in detail as follows:

In S410, an optimal integrated candidate block for a current block isdetermined based on an encoded bit stream obtained by using a motionvector integration technology.

The encoded bit stream may be decoded to obtain a flag indicatingwhether a motion vector integration technology is used, and informationabout the optimal integrated candidate block may be obtained by decodingaccording to the flag. If a flag indicating that a motion vectorintegration technology is used is obtained by decoding, it is determinedthat the encoded bit stream uses the motion vector integrationtechnology. In the encoded bit stream based on the motion vectorintegration technology, an index value and a residual of the optimalintegrated candidate block may be carried. Motion information of theoptimal integrated candidate block may be determined by using the indexvalue, where the motion information may include a prediction direction,a motion vector, and reference frame information of the optimalintegrated candidate block.

In S420, when a motion vector derivation mode needs to be used, themotion vector derivation mode that needs to be used is determined basedon the prediction direction of the optimal integrated candidate block.In S430, the motion vector of the current block is corrected based onthe motion vector derivation mode.

When the decoder determines that a motion vector derivation mode needsto be used, whether a bi-directional mode, a forward mode, or a backwardmode needs to be used may be determined according to the predictiondirection, and the motion vector of the current block may be correctedin different cases. Operations of S420 and S430 are basically the sameas S120 and S130 of FIG. 1.

According to an embodiment of the present invention, the motion vectorof the current block may be corrected according to the motion vector andreference frame information of the optimal integrated candidate block.

According to an embodiment of the present invention, a method 500 shownin FIG. 5 may be used to correct the motion vector of the current block.

In S510, an initial reference block in a forward reference frame and aninitial reference block in a backward reference frame are determined forthe current block based on the motion vector and reference frameinformation; in S520, sub-pixel neighborhoods of the initial referenceblocks are searched for reference blocks having a minimum matchingerror; and in S530, the motion vector of the current block is correctedbased on the found reference blocks.

Operations of S510, S520, and S530 are basically the same as S210, S220,and S230 of FIG. 2, and to avoid repetition, details are not describedherein again. According to the above description, the encoder and thedecoder may correct the motion vector based on the same motion vectorderivation process.

In S440, the current block is decoded based on the corrected motionvector and the residual carried in the encoded bit stream.

After the corrected motion vector is obtained, a reference block in theforward reference frame and a reference block in the backward referenceframe may be determined for the current block, and the predicted valueof the current block is determined based on the reference blocks. Forexample, the predicted value may be an average value of the referenceblocks. Then, the predicted value is corrected based on the residualcarried in the encoded bit stream, thereby obtaining a more accuratedecoding value of the current block.

FIG. 6 shows a flowchart of another decoding method 600 according to anembodiment of the present invention.

S610, S620, S630, and S640 in the decoding method 600 are basically thesame as S410, S420, S430, and S440 in the decoding method 400 of FIG. 4.

According to an embodiment of the present invention, the decoding method600 may further include S615 before S620. In S615, based on at least oneof a prediction direction and an instruction flag which is carried in anencoded bit stream and is for instructing the decoder to implement amotion vector derivation mode or not, it is determined whether to use amotion vector derivation mode.

For example, if a prediction direction of an optimal integratedcandidate block is a bi-directional prediction, the decoder determinesthat a motion vector derivation mode needs to be used. If the predictiondirection is a forward prediction or a backward prediction, the decoderdetermines not to use a motion vector derivation mode.

For another example, if an instruction flag for instructing the decoderto implement a motion vector derivation mode is carried in the encodedbit stream, the decoder determines that a motion vector derivation modeneeds to be used. If an instruction flag for instructing the decoder notto implement a motion vector derivation mode is carried in the encodedbit stream, the decoder determines that it is unnecessary to use amotion vector derivation mode.

For a further example, if a prediction direction is a forwardprediction, and an instruction flag for instructing the decoder toimplement a motion vector derivation mode is carried in the encoded bitstream, the decoder determines that a motion vector derivation modeneeds to be used. If the prediction direction is a backward prediction,and an instruction flag for instructing the decoder not to implement amotion vector derivation mode is carried in the encoded bit stream, thedecoder determines that it is unnecessary to use a motion vectorderivation mode.

S620, S630, and S640 are performed when it is determined, in S615, thata motion vector derivation mode needs to be used.

According to the decoding method provided by the embodiment of thepresent invention, compared with a conventional motion vector derivationtechnology of a decoder, in a process of correcting a motion vector of acurrent block to search for reference blocks having a minimum matchingerror, because a search range is limited to a small range of a sub-pixelneighborhood, and because a reference frame is fixed, calculationcomplexity may be reduced in comparison with searching in a large range.Compared with a conventional motion vector integration technology, themotion vector is corrected so that the predicted value is more accurate,and correction is performed by using the residual so that a decodingresult is more accurate, thereby improving decoding quality. Inaddition, although the residual in the bit stream is small, on the basisthat the correction for the motion vector does not affect the predictionfor the current block, a residual having a smaller number of bits may becarried in the bit stream to perform prediction, thereby avoiding anincrease in data bandwidth and facilitating improvement of encodingefficiency.

The encoding method and the decoding method for a video image accordingto the embodiments of the present invention are described above. Thefollowing describes an encoding apparatus for a video image according toan embodiment of the present invention with reference to FIG. 7 and FIG.8, and describes a decoding apparatus for a video image according to anembodiment of the present invention with reference to FIG. 9 and FIG.10.

FIG. 7 is a structural block diagram of an encoding apparatus 700according to an embodiment of the present invention.

The encoding apparatus 700 includes a first determining module 710, asecond determining module 720, a correcting module 730, and an encodingmodule 740. The first determining module 710 may be configured todetermine an optimal integrated candidate block for a current blockbased on a motion vector integration technology. The second determiningmodule 720 may be configured to determine, based on a predictiondirection of the optimal integrated candidate block, a motion vectorderivation mode that needs to be used by a decoder. The correctingmodule 730 may be configured to correct a motion vector of the currentblock based on the motion vector derivation mode. The encoding module740 may be configured to determine a residual between a predicted valueand an original value of the current block based on the corrected motionvector, thereby encoding the current block.

Reference may be made to S110, S120, S130, and S140 in the encodingmethod 100 for the above and other operations and/or functions of thefirst determining module 710, the second determining module 720, thecorrecting module 730, and the encoding module 740 of the encodingapparatus 700, and details are not described herein again to avoidrepetition.

With the encoding apparatus according to the embodiment of the presentinvention, a motion vector of a current block is determined firstlyaccording to a motion vector of an optimal integrated candidate block,and a motion vector derivation mode is used to correct the motion vectorof the current block; therefore, the motion vector of the current blockis more accurate, a calculated residual is smaller, and the residualoccupies a smaller number of bits in an encoded bit stream, therebyimproving encoding efficiency and avoiding an increase in databandwidth. Meanwhile, because the corrected motion vector is used, apredicted value estimated based on the corrected motion vector is closerto an original value of the current block, and because correction isperformed by using the residual, a more accurate decoding result may beobtained by a decoder, thereby improving image decoding quality.

FIG. 8 shows a structural block diagram of another encoding apparatus800 according to an embodiment of the present invention.

A first determining module 810, a second determining module 820, acorrecting module 830, and an encoding module 840 in the encodingapparatus 800 are basically the same as the first determining module710, the second determining module 720, the correcting module 730, andthe encoding module 740 of the encoding apparatus 700 in FIG. 7.

According to an embodiment of the present invention, the correctingmodule 830 may be configured to correct a motion vector of a currentblock according to a motion vector and reference frame information of anoptimal integrated candidate block.

According to an embodiment of the present invention, the correctingmodule 830 may include a first determining unit 832, a searching unit834, and a correcting unit 836. The first determining unit 832 may beconfigured to determine an initial reference block in a forwardreference frame and an initial reference block in a backward referenceframe for the current block based on the motion vector and the referenceframe information. The searching unit 834 may be configured to searchsub-pixel neighborhoods of the initial reference blocks for referenceblocks having a minimum matching error. The correcting unit 836 may beconfigured to correct the motion vector of the current block based onthe found reference blocks.

According to an embodiment of the present invention, the encoding module840 may include a first encoding unit 842. The first encoding unit 842may be configured to, if the residual is smaller than an initialresidual that is determined based on a motion vector integrationtechnology, carry, in an encoded bit stream, the residual and a firstinstruction flag for instructing a decoder to implement a motion vectorderivation mode.

According to an embodiment of the present invention, the encoding module840 may further include a second encoding unit 844. The second encodingunit 844 may be configured to, if the residual is greater than or equalto the initial residual, carry, in the encoded bit stream, the initialresidual and a second instruction flag for instructing the decoder notto implement the motion vector derivation mode.

Reference may be made to S130 and S140 in the encoding method 100 andS210 to S230 in the method 200 for correcting a motion vector for theabove and other operations and/or functions of the correcting module830, the first determining unit 832, the searching unit 834, thecorrecting unit 836, the first encoding unit 842, and the secondencoding unit 844, and details are not described herein again to avoidrepetition.

According to the encoding apparatus provided by the embodiment of thepresent invention, compared with a conventional motion vector derivationtechnology of a decoder, in a process of correcting a motion vector of acurrent block to search for reference blocks having a minimum matchingerror, because a search range is limited to a small range of a sub-pixelneighborhood, and because a reference frame is fixed, calculationcomplexity may be reduced in comparison with searching in a large range.Compared with a conventional motion vector integration technology, themotion vector is corrected so that a predicted value is more accurateand a residual is smaller, thereby facilitating improvement of decodingquality; in addition, the residual occupies a smaller number of bits inan encoded bit stream, thereby improving encoding efficiency andavoiding an increase in data bandwidth.

FIG. 9 shows a structural block diagram of a decoding apparatus 900according to an embodiment of the present invention.

The decoding apparatus 900 includes a first determining module 910, asecond determining module 920, a correcting module 930, and a decodingmodule 940. The first determining module 910 may be configured todetermine an optimal integrated candidate block for a current blockbased on an encoded bit stream obtained by using a motion vectorintegration technology. The second determining module 920 may beconfigured to, when a motion vector derivation mode needs to be used,determine, based on a prediction direction of the optimal integratedcandidate block, the motion vector derivation mode that needs to beused. The correcting module 930 may be configured to correct a motionvector of the current block based on the motion vector derivation mode.The decoding module 940 may be configured to decode the current blockbased on the corrected motion vector and a residual carried in theencoded bit stream.

Reference may be made to S410, S420, S430, and S440 in the decodingmethod 400 for the above and other operations and/or functions of thefirst determining module 910, the second determining module 920, thecorrecting module 930, and the decoding module 940 of the decodingapparatus 900, and details are not described herein again to avoidrepetition.

With the decoding apparatus according to the embodiment of the presentinvention, when it is necessary to use a motion vector derivation mode,a more accurate motion vector may be obtained by correcting an initialmotion vector of the current block, thereby obtaining a more accuratepredicted value; in addition, the correction is performed by using theresidual, image decoding quality may be improved. Meanwhile, althoughthe residual in the bit stream is small, on the basis that thecorrection for the motion vector does not affect the prediction for thecurrent block, a residual having a smaller number of bits may be carriedin the bit stream to perform prediction, thereby avoiding an increase indata bandwidth and facilitating improvement of encoding efficiency.

FIG. 10 shows a structural block diagram of another decoding apparatus1100 according to an embodiment of the present invention.

A first determining module 1010, a second determining module 1020, acorrecting module 1030, and a decoding module 1040 in the decodingapparatus 1000 are basically the same as the first determining module910, the second determining module 920, the correcting module 930, andthe decoding module 940 of the decoding apparatus 900 in FIG. 9.

According to an embodiment of the present invention, the correctingmodule 1030 may be configured to correct a motion vector of a currentblock according to a motion vector and reference frame information of anoptimal integrated candidate block.

According to an embodiment of the present invention, the correctingmodule 1030 may include a first determining unit 1032, a searching unit1034, and a correcting unit 1036. The first determining unit 1032 may beconfigured to determine an initial reference block in a forwardreference frame and an initial reference block in a backward referenceframe for the current block based on the motion vector and the referenceframe information. The searching unit 1034 may be configured to searchsub-pixel neighborhoods of the initial reference blocks for referenceblocks having a minimum matching error. The correcting unit 1036 may beconfigured to correct the motion vector of the current block based onthe found reference blocks.

According to an embodiment of the present invention, the decoding module1000 may further include a third determining module 1050. The thirddetermining module 1050 may be configured to determine, based on atleast one of a prediction direction and an instruction flag which iscarried in an encoded bit stream and is for instructing a decoder toimplement the motion vector derivation mode or not, whether to use amotion vector derivation mode. When the third determining module 1050determines that the motion vector derivation mode needs to be used, thesecond determining module 1020, the correcting module 1030, and thedecoding module 1040 need to complete corresponding operations.

Reference may be made to S430 in the decoding method 400, S510 to S530in the method 500 for correcting a motion vector, and S615 in thedecoding method 600 for the above and other operations and/or functionsof the correcting module 1030, the first determining unit 1032, thesearching unit 1034, the correcting unit 1036, and the third determiningmodule 1050, and details are not described herein again to avoidrepetition.

According to the decoding apparatus provided by the embodiment of thepresent invention, compared with a conventional motion vector derivationtechnology of a decoder, in a process of correcting a motion vector of acurrent block to search for reference blocks having a minimum matchingerror, because a search range is limited to a small range of a sub-pixelneighborhood, and because a reference frame is fixed, calculationcomplexity may be reduced in comparison with searching in a large range.Compared with a conventional motion vector integration technology, themotion vector is corrected so that the predicted value is more accurate,and correction is performed by using the residual so that a decodingresult is more accurate, thereby improving decoding quality. Inaddition, although the residual in the bit stream is small, on the basisthat the correction for the motion vector does not affect the predictionfor the current block, a residual having a smaller number of bits may becarried in the bit stream to perform prediction, thereby avoiding anincrease in data bandwidth and facilitating improvement of encodingefficiency.

Next, a system 1100 for a video image according to an embodiment of thepresent invention is described with reference to FIG. 11. The system1100 may be used to encode and decode a video image.

The system 1100 includes an encoding apparatus 1110 and a decodingapparatus 1120.

The encoding apparatus 1110 may be the encoding apparatus 700 shown inFIG. 7, and the decoding apparatus 1120 may be the decoding apparatus900 shown in FIG. 9.

The encoding apparatus 1110 may further include the correcting module830, the first determining unit 832, the searching unit 834, thecorrecting unit 836, the first encoding unit 842, and the secondencoding unit 844 shown in FIG. 8. The decoding apparatus 1120 mayfurther include the correcting module 1030, the first determining unit1032, the searching unit 1034, and the correcting unit 1036 shown inFIG. 10.

According to the system for encoding and decoding a video image providedby the embodiment of the present invention, a motion vector in a motionvector integration technology is corrected by using a motion vectorderivation technology, so that a more accurate predicted value of acurrent block may be obtained, thereby reducing a residual between thepredicted value and an original value, and an encoder is allowed toencode the residual into an encoded bit stream by using a small numberof bits, thereby improving encoding efficiency and avoiding an increasein data bandwidth. Meanwhile, because the corrected motion vector isused, a predicted value estimated based on the corrected motion vectoris closer to an original value of the current block, and becausecorrection is performed by using the residual, a decoder may obtain amore accurate decoding result, thereby improving image decoding quality.

According to the system provided by the embodiment of the presentinvention, compared with a conventional motion vector derivationtechnology of a decoder, in a process of correcting a motion vector of acurrent block to search for reference blocks having a minimum matchingerror, because a search range is limited to a small range of a sub-pixelneighborhood, calculation complexity may be reduced in comparison withsearching in a large range. Compared with a conventional motion vectorintegration technology, the motion vector is corrected so that apredicted value is more accurate and a residual is smaller, therebyfacilitating improvement of decoding quality; in addition, the residualoccupies a smaller number of bits in an encoded bit stream, therebyimproving encoding efficiency and avoiding an increase in databandwidth.

Those skilled in the art may note that, the method steps and units withreference to the description in the embodiments disclosed herein may beimplemented by using electronic hardware, computer software, or acombination thereof, and in order to clearly describe interchangeabilityof hardware and software, the above description describes steps andcombinations of various embodiments according to general functions.These functions may be executed in a mode of hardware or softwaredepending on the specific application and design constraint condition ofthe technical solutions. Those skilled in the art may use differentmethods for each specific application to implement the describedfunctions; however, the implementation shall not be regarded asexceeding the scope of the present invention.

The technology according to the embodiments of the present invention maybe applied in a digital signal processing field, and may be implementedby using an encoder and a decoder. A video encoder and a video decoderare widely used in various communication devices or electronic devices,such as a digital television, set-top box, media gateway, mobile phone,wireless apparatus, personal digital assistant (PDA), hand-held orportable computer, GPS receiver/navigator, camera, video player, videocamera, video recorder, monitoring device, videoconferencing andvideophone device. These devices each include a processor, a storage,and an interface for transferring data. The video encoder and decodermay be implemented directly by using a digital circuit or chip, such asa DSP (digital signal processor), or may be implemented by using asoftware code to drive a processor to execute a procedure in thesoftware code.

The method steps with reference to the description of the embodimentsdisclosed herein may be implemented by using hardware, a softwareprogram to be executed by a processor, or a combination thereof. Thesoftware program may be placed in a random access memory (RAM), amemory, a read-only memory (ROM), an electrically programmable ROM, andan electrically erasable programmable ROM, a register, a hard disk, aremovable magnetic disk, a CD-ROM, or the storage media of any otherforms well-known in the art.

Although some embodiments of the present invention are shown anddescribed, those skilled in the art shall understand that variousmodification may be made to the embodiments without departing from theprinciple and idea of the present invention, and such modificationsshall fall within the scope of the present invention.

What is claimed is:
 1. An encoding method for a video image, comprising: determining an optimal integrated candidate block for a current block based on a motion vector integration technology; determining, based on a prediction direction of the optimal integrated candidate block, whether a motion vector derivation mode needs to be used by a decoder; upon determining that the motion vector derivation mode needs to be used by the decoder, correcting a motion vector of the current block based on the motion vector derivation mode; and determining a residual between a predicted value of the current block and an original value of the current block based on the corrected motion vector, thereby encoding the current block, wherein: the motion vector integration technology uses a motion vector of the optimal integrated candidate block as the motion vector for the current block based on a motion vector correlation between the optimal integrated candidate block and the current block, and the predicted value of the current block is based on the corrected motion vector.
 2. The encoding method according to claim 1, wherein the correcting a motion vector of the current block comprises: correcting the motion vector of the current block according to a motion vector and reference frame information of the optimal integrated candidate block.
 3. The encoding method according to claim 2, wherein the correcting a motion vector of the current block comprises: determining an initial reference block in a forward reference frame and an initial reference block in a backward reference frame for the current block based on the motion vector and the reference frame information; searching sub-pixel neighborhoods of the initial reference blocks for reference blocks having a minimum matching error; and correcting the motion vector of the current block based on the found reference blocks.
 4. The encoding method according to claim 1, wherein the encoding the current block comprises: upon determining the residual is smaller than an initial residual that is determined based on the motion vector integration technology, carrying, in an encoded bit stream, the residual and a first instruction flag for instructing the decoder to use the motion vector derivation mode.
 5. The encoding method according to claim 4, wherein the encoding the current block further comprises: upon determining the residual is greater than or equal to the initial residual, carrying, in the encoded bit stream, the initial residual and a second instruction flag for instructing the decoder not to use the motion vector derivation mode.
 6. A decoding method for a video image, comprising: determining an optimal integrated candidate block for a current block based on an encoded bit stream obtained by using a motion vector integration technology; upon determining a motion vector derivation mode needs to be used, determining, based on a prediction direction of the optimal integrated candidate block, the motion vector derivation mode that needs to be used; upon determining the motion vector derivation mode need to be used, correcting a motion vector of the current block based on the determined motion vector derivation mode; and decoding the current block based on the corrected motion vector and a residual carried in the encoded bit stream, wherein the motion vector integration technology uses a motion vector of the optimal integrated candidate block as the motion vector for the current block based on a motion vector correlation between the optimal integrated candidate block and the current block.
 7. The decoding method according to claim 6, wherein the correcting a motion vector of the current block comprises: correcting the motion vector of the current block according to a motion vector and reference frame information of the optimal integrated candidate block.
 8. The decoding method according to claim 7, wherein the correcting a motion vector of the current block comprises: determining an initial reference block in a forward reference frame and an initial reference block in a backward reference frame for the current block based on the motion vector and the reference frame information; searching sub-pixel neighborhoods of the initial reference blocks for reference blocks having a minimum matching error; and correcting the motion vector of the current block based on the found reference blocks.
 9. The decoding method according to claim 6, wherein before determining the motion vector derivation mode that needs to be used, the method further comprises: based on one or a combination of the prediction direction and an instruction flag which is carried in the encoded bit stream and is for instructing a decoder to use the motion vector derivation mode or not, determining whether to use the motion vector derivation mode.
 10. An encoding apparatus for a video image, comprising: at least one hardware processor; a memory including instructions to control the at least one hardware processor to: determine an optimal integrated candidate block for a current block based on a motion vector integration technology; determine, based on a prediction direction of the optimal integrated candidate block, whether a motion vector derivation mode needs to be used by a decoder; upon determining that the motion vector derivation mode needs to be used by the decoder, correct a motion vector of the current block based on the motion vector derivation mode; and determine a residual between a predicted value of the current block and an original value of the current block based on the corrected motion vector, thereby encoding the current block, wherein: the motion vector integration technology uses a motion vector of the optimal integrated candidate block as the motion vector for the current block based on a motion vector correlation between the optimal integrated candidate block and the current block, and the predicted value of the current block is based on the corrected motion vector.
 11. The encoding apparatus according to claim 10, wherein the instruction further control the at least one hardware processor to correct the motion vector of the current block according to a motion vector and reference frame information of the optimal integrated candidate block.
 12. The encoding apparatus according to claim 11, wherein the instruction further control the at least one hardware processor to: determine an initial reference block in a forward reference frame and an initial reference block in a backward reference frame for the current block based on the motion vector and the reference frame information; search sub-pixel neighborhoods of the initial reference blocks for reference blocks having a minimum matching error; and correct the motion vector of the current block based on the found reference blocks.
 13. The encoding apparatus according to claim 10, wherein the instruction further control the at least one hardware processor to: upon determining the residual is smaller than an initial residual that is determined based on the motion vector integration technology, carry, in an encoded bit stream, the residual and a first instruction flag for instructing the decoder to use the motion vector derivation mode.
 14. The encoding apparatus according to claim 13, wherein the instruction further control the at least one hardware processor to: upon determining the residual is greater than or equal to the initial residual, carry, in the encoded bit stream, the initial residual and a second instruction flag for instructing the decoder not to use the motion vector derivation mode.
 15. A decoding apparatus for a video image, comprising: at least one hardware processor; a memory including instructions to control the at least one hardware processor to: determine an optimal integrated candidate block for a current block based on an encoded bit stream obtained by using a motion vector integration technology, the motion vector integration technology using a motion vector of the optimal integrated candidate block as a motion vector for the current block based on a motion vector correlation between the optimal integrated candidate block and the current block; when a motion vector derivation mode needs to be used, determine, based on a prediction direction of the optimal integrated candidate block, the motion vector derivation mode that needs to be used; correct the motion vector of the current block based on the determined motion vector derivation mode; and decode the current block based on the corrected motion vector and a residual carried in the encoded bit stream.
 16. The decoding apparatus according to claim 15, wherein the instructions further control the at least one hardware processor to correct the motion vector of the current block according to a motion vector and reference frame information of the optimal integrated candidate block.
 17. The decoding apparatus according to claim 16, wherein the instructions further control the at least one hardware processor to: determine an initial reference block in a forward reference frame and an initial reference block in a backward reference frame for the current block based on the motion vector and the reference frame information; search sub-pixel neighborhoods of the initial reference blocks for reference blocks having a minimum matching error; and correct the motion vector of the current block based on the found reference blocks.
 18. The decoding apparatus according to claim 15, wherein the instructions further control the at least one hardware processor to: based on one or a combination of the prediction direction and an instruction flag which is carried in the encoded bit stream and is for instructing a decoder to use the motion vector derivation mode or not, determine whether to use the motion vector derivation mode.
 19. A system for a video image, comprising: an encoding apparatus for a video image, comprising: at least one hardware processor; a memory including instructions to control the at least one hardware processor to: determine an optimal integrated candidate block for a current block based on a motion vector integration technology, the motion vector integration technology using a motion vector of the optimal integrated candidate block as the motion vector for the current block based on a motion vector correlation between the optimal integrated candidate block and the current block; to determine, based on a prediction direction of the optimal integrated candidate block, whether a motion vector derivation mode needs to be used by a decoder; upon determining that the motion vector derivation mode needs to be used by the decoder, correct a motion vector of the current block based on the motion vector derivation mode; and determine a residual between a predicted value of the current block and an original value of the current block based on the corrected motion vector, thereby encoding the current block, wherein the predicted value of the current block is based on the corrected motion vector; and an decoding apparatus for a video image, comprising: at least one other hardware processor; a memory including instructions to control the at least one other hardware processor to implement: determine an optimal integrated candidate block for a current block based on an encoded bit stream obtained by using a motion vector integration technology; when a motion vector derivation mode needs to be used, determine, based on a prediction direction of the optimal integrated candidate block, the motion vector derivation mode that needs to be used; correct a motion vector of the current block based on the determined motion vector derivation mode; and decode the current block based on the corrected motion vector and a residual carried in the encoded bit stream. 